Generalized entropies through Bayesian estimation

نویسندگان

  • Dirk Holste
  • Hanspeter Herzel
چکیده

The demand made upon computational analysis of observed symbolic sequences has been increasing in the last decade. Here, the concept of entropy receives applications, and the generalizations according to Tsallis H (T) q and R enyi H (R) q provide whole-spectra of entropies characterized by an order q. An enduring practical problem lies in the estimation of these entropies from observed data. The nite size of data sets can lead to serious systematic and statistical estimation errors. We focus on the problem of estimating generalized entropies from limited data samples and derive a Bayesian estimator of the Tsallis entropy, H (T) q , including the (q = 1) Shannon entropy. By extending our previous results on statistical entropy estimation of symbol sequences 12], we use a prior distribution over the probabilities which is of Dirichlet-type. Using the relationship between H (T) q and H (R) q , we utilize the Bayesian entropy estimator H (T) q to estimate the R enyi entropy H (R) q from observed data. The Bayesian estimator yields the smallest mean-squared deviation from the true parameter as compared with any other estimator. We compare the Bayesian entropy estimators with the frequency-count estimators of H (T) q and H (R) q. Numerical simulations reveal that the Bayesian entropy estimator reduces statistical estimation errors of generalized entropies for statistical processes such as generated by higher-order Markov models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Maximum Likelihood Estimation and Bayesian with Generalized Gibbs Sampling for Ordinal Regression Analysis of Ovarian Hyperstimulation Syndrome

Background and Objectives: Analysis of ordinal data outcomes could lead to bias estimates and large variance in sparse one. The objective of this study is to compare parameter estimates of an ordinal regression model under maximum likelihood and Bayesian framework with generalized Gibbs sampling. The models were used to analyze ovarian hyperstimulation syndrome data.   Methods: This study use...

متن کامل

Generalized information criteria for optimal Bayes decisions

This paper deals with Bayesian models given by statistical experiments and standard loss functions. Bayes probability of error and Bayes risk are estimated by means of classical and generalized information criteria applicable to the experiment. The accuracy of the estimation is studied. Among the information criteria studied in the paper is the class of posterior power entropies which includes ...

متن کامل

Structure Inference of Bayesian Networks from Data: A New Approach Based on Generalized Conditional Entropy

We propose a novel algorithm for extracting the structure of a Bayesian network from a dataset. Our approach is based on generalized conditional entropies, a parametric family of entropies that extends the usual Shannon conditional entropy. Our results indicate that with an appropriate choice of a generalized conditional entropy we obtain Bayesian networks that have superior scores compared to ...

متن کامل

Bayesian Inference for Spatial Beta Generalized Linear Mixed Models

In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...

متن کامل

Hyperbolic Cosine Log-Logistic Distribution and Estimation of Its Parameters by Using Maximum Likelihood Bayesian and Bootstrap Methods

‎In this paper‎, ‎a new probability distribution‎, ‎based on the family of hyperbolic cosine distributions is proposed and its various statistical and reliability characteristics are investigated‎. ‎The new category of HCF distributions is obtained by combining a baseline F distribution with the hyperbolic cosine function‎. ‎Based on the base log-logistics distribution‎, ‎we introduce a new di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998